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ABSTRACT 

The problem of determining extreme percentiles of a distribution 
function by methods based on the Robbins-Monro procedure is examined 
by computer simulation. Several techniques for improving the 
A stopping rule 


convergence properties are proposed and studied. 


based on binomial confidence limits is examined. 
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A. The Problem 

Consider a problem in sensitivity testing where items are selected 
at random from a population and subjected to a known stimulus level x. 
The actual, critical stimulus level of each item, X, is a random variable 
whose distribution My ( x ) is the distribution of the critical stimulus 
levels in the population. X cannot be directly observed but information 
about it is gained by observing the response or non-response of each 
item when the known level x is applied. The magnitude of x may vary 
from item to item in the sample and may depend, at each trial, upon 
previous observed outcomes. Assume, also, that each item within the 
sample is subjected to only one magnitude x and is tested only one time. 

Let Y( x ) be a random function such that 

lifX £ x (response occurs) 
YC) es 
O:if XSi (non-response occurs) 

Therefore, to each value x there corresponds a random variable Y(€ x _ ) 
with a distribution function P{ x x ) y | such that if pf Y(x )< ‘ 
= Gy(y/ x ) then My( x) = f yacye y/x)=P(XS x) =E & (x) J 

Lie ROG i Ge ae lee Me given constant such that the equation 
Mx( x ) = © has a unique but unknown root, © , i-e., My( @ ) = Oo 
We are interested in estimating this root where eo is the proportion of 
the population which will respond to stimulus level 6 

Parametric and non-parametric methods exist for the class of problems 
outlined above. However, known methods have not been found practical in 
the case of sensitivity testing for an extreme percentile of a distribution. 


The parametric methods such as (5) currently used by NOL fail because of 


the drastic limitations the assumed distribution imposes on the tail of 





the function My( x ). The non-parametric methods seem to require extremely 
large sample sizes and lack a workable stopping rule, i.e., a technique 
to tell us when to stop sampling while giving us some "measure of close- 
ness" to the true © point. 

Among the most popular of the non-parametric methods is the Robbins- 
Monro stochastic approximation method. The purpose of this paper will be 
to computationally, rather than analytically, investigate this method in 
an effort to: first, find ways of improving its convergence; second, 


outline the method for a stopping rule. 








5. Background 
The non-parametric stochastic approximation method suggested by 


Robbins and Monro (1) involved selecting a sequence of positive constants 


an , n= 1, such that 
co oo 
(2) > a, = oF and ~ a* < co and constructing a non- 
n=1 n=1 


stationary Markov chain Ln f to serve as the approximation of © where 


(3) x, = *n-1 7 @n ike (x, 1) -ox] , n= 1, and the zeroth 


approximation, x is an arbitrary gures for =, They proved that 


°°? 
this sequence of point estimates of © will converge to the true © 
point in probability. It should be noted that M(x) as used in their 
paper, referred to a much more general class of functions of which the 
continuous probability distribution has been proven to be a subclass. 
Blum (2), using two additional assumptions on M(x), proved that x, 
will converge to @ with probability 1. 

The main questions associated with this method are the speed of 
convergence and the choice of the coefficients, a,, to maximize the 
speed. Chung (3) attacked this problem by studying the asymptotic 
behavior of the moments of X, and was able to prove asymptotic normality 
under certain conditions. In particular, theorem 9 (3) with a, = c/n, 
then nif2 6.65 - © ) tends to the normal distribution N [o, © 202/ (20 4 c-1) 
in probability, where ©, = M'( ©@)> 0 and CO 2 is the variance of 


Y¥( © ). Thus the asymptotic variance of X, tends to zero with the speed 


1/n. But among the assumptions in this theorem Chung notes that 


lj M(x)/ 0 
TBs oo [ x)/x} > 





=> 


is very restricted from a statistical point of view since M( x ) is 
not, in this case, a bounded function and therefore not a distribution. 
Chung studies another case, theorem 6 (3), in which M( x ) is 
bounded, where the coefficients are a, = 1/n GE) | where E must exceed 
a positive number 1/2(1+K,) and where K, depends on the problem. He 
shows that n{l-E)/2(x, - @ ) will have a normal limit so that the 


(L-E) But here also it 


variance of X, tends to zero with the speed 1/n 
is pointed out that this speed is not statistically practical. 


In an effort to resolve the above problem Hodges and Lehmann (4), 


using theorem 1 (2), prove that theorem 9 (3) remains valid when the 





condition ——_ | Mx) ix | > 0 


is removed thus allowing the result to be used when M( x ) is bounded, 
e.g., continuous distribution functions are therefore allowable. In 
particular, they recommend the use of a, owe c/n and suggest that c 


be chosen so as to minimize the asymptotic normal variance 


oe 2/(2 e¢ port). It can be shown that c = 1/ om, , is a minimizing 
value of this variance and will reduce it to the form ( JQ / xX, 2, 
However, it is pointed out that in practice o<, , is not itself known 
and must be guessed. Since it is assumed that M'( @ ) = & , > 0 : 
it is suggested that a "safe'' small a priori estimate for Xs be used 


to make c > 1/2 «x, ; 





Cc. Methods of Attack 

1. Introduction 

Any practical application of the Robbins-Monro process, i.e., 

to obtain a reasonably accurate approximation of S with a limited 
sample size, forces us to make two assumptions which are not impossible 
to fulfill in areas where use of the process might be considered. The 
first is that the experimenter be capable of drawing large sample sizes, 
n >> 200. The second is that enough previous data including, possibly, 
past parametric results should be available to enable the experimenter to 


make an initial guess, x , which is reasonably close to the true 


Oo 
point. In most of the methods of attack considered in this paper it will 
be tacitly assumed the above conditions are obtainable. 

The authors will also follow the recommendation in (4) and use the 
coefficients a, = c/n, where c is a constant with the addition of a 


case where c is a function of n and, of course, equation (2) must be 


satisfied. Therefore, we are now employing the recursion relation, 


(4) x, = ec a [ vo) = «|, n2il 


where Y( x, ) is defined as in equation (1). Also we are restricting 
our attention to large values of @ since the character of the problem 
and the results may be complemented to apply to the case of small values 
of XC , 

The mathematical analysis of this problem is extremely difficult 
and for this reason our mode of attack is via computer simulation in the 
hope that some practical quantitative information can be obtained to 


serve as a guide for future analyses. 


—_— —_—_—— ~~ ee |= ea 


Me ~—o 


oe -~— 4 ” Si 





The first difficulty to emerge is that, for large c< , movement 
of the approximation, X, » to the right far outbalances movements to 
the left, i.e., from equation (4) movement is either c/n(C¢ ) or 
- c/n(1l- << ) to the right or left, respectively. As a result even 
moderate deviations from the expected success-failure pattern may drive 
the approximation, x, , so far from © that recovery becomes increas- 
ingly difficult as n grows large. The likelihood of this occurrence can 
be diminished by using either a "moderate" value of c or a deliberately 
high value for the guess, x, . 

The effect of the choice of c is discussed in sub-section 2 while 

the use of a deliberately large Xo is treated in sub-section 3. The 
use of a priori barriers on the x, is considered in sub-section 4. 
A comparison of sample variances with the theoretical asymptotic variance 
is made in sub-section 5 and a block averaging technique is taken up in 
sub-section 6. An endeavor, which we call a pseudo confidence interval, 
to obtain a stopping rule utilizing the success-failure pattern of the 
process is outlined in sub-section 7. 

2. The Value of c 

As noted in the introduction, too large a value of c leads to 
great oscillation in the fxg while too small a value results in 
insufficient movement. Due to the impossibility of finding an optimal c, 
which would depend upon the actual distribution and the accuracy of the 
guess both of which are unknown, we have left its determination up to 
the overall results of the computational analysis. Only a cursory pre- 
liminary investigation was made in an effort to find a "range of c" for 
the two diverse distributions, the standard normal and the uniform on 


the interval (0,1). That is, assuming knowledge that those were the 





actual distributions of the critical stimulus levels, equation (4) was 


utilized for various values of x and c while, at each stage, 


O 
comparing the value of the current a with the true 8 point and 
stopping if the difference between the two was less than or equal to 


some small value. Overall results are listed in Appendix 1 while, for 


the reader's immediate information, a summary of the "best" 


ici e, bon - et 0.001 )e for various Xo is given here. 
u(0,1) Cc Xo ml m¢ 
0.6-1.0 Ons: 25-150 5, 
0.7-1.0 0.8 25-100 296 
O.9=1.0 O28 50-100 oo7 
0.8-1.0 0.8 25-100 Behe: 
0.5-1.0 Oey 25250 oNs 
Oe Roles @) Oey Zo 95 
ALS 0.6 20 oh) 
N(O,1) Cc a n 
e< =0.95 b= 3. 0 ee, 50-400 
1.7-6.5 oe 20-150 
1.8-4.5 Lio 10-350 
1.8-4.0 20 10-250 
329-6.) Le 23-150 


The values of n above are approximately the range for the various 


values of c at which ( x, - OF ) S& 0.001. 





The accuracy of the guess x, 4=seems to be the most sensitive 
parameter, i.e., for close guesses moderate values of c seem to be best 
while for more distant guesses larger values of c work best. Further, 
with a distribution such as the normal overall results seem to be heavily 
dependent upon which side of S the guess is located. 

The asymptotic analysis of Chung (3) seems to indicate the use of a 
larger value for c than is recommended by our results. However, it must 
be emphasized that Chung was studying theory which would hold uniformly 
and, hence, for very distant guesses; and that a priori information and 
limited sample size was not a consideration. 

3. The High Guess 

With the observation about the amount of movement of the 
sequence given in the introduction it seems that a high guess would 
"most probably" be better than a low one. Our results seem to indicate 
that it is better. 

Therefore, a deliberately high guess could be used to lessen the 
likelihood that the "best" guess is below the O point. However, this 
should be done cautiously. For example, using the N(0,1) as the test 
distribution and e€ = 0.95 with c = 2.0-3.0, any high guess up to about 
2.0 will generally result in a sequence that arrives in the neighborhood 
of the true bo point in about 200 iterations. However, with guesses 
above this point moderate values of c are overwhelmed by n_ so quickly 
that x, seldom gets close to © . 

As cf increases,the value of a high guess is even more important. 
Data seems to indicate that for a low guess, with COe= 009 the 
result after n iterations will be farther away from © than would a 


correspondingly low guess with a smaller @&€ . 





Any decision to increase a "best'' guess by some increment before 
starting the process should be made using past data and results as a 
basis. 

4. A Priori Static Barriers 

Since the risks involved with high or low guesses have been 
pointed out in previous sub-sections, we feel it only necessary to recall 
the first two assumptions made in the introduction to this section to 
proceed with a discussion of this method. 

Given these assumptions, it seems reasonable to assume an experi- 
menter who, confident in the accuracy of his initial guess, may also be 
confident enough to place a priori limits around this guess beyond which 
the x, cannot travel. The reason for wanting limits on x, is 
obviously to avoid the risks pointed out earlier. Further, the existence 
of limits tends to make the choice of a value for c much less critical. 

Since this was not the only method studied,only one form of this 
method was programmed. Barriers were set symmetrically about our best 
guess and whenever x, was either greater than the upper bound or less 
than the lower bound it was set equal to x,_j. The iterations were 
allowed to continue but n was regarded as having been increased and a 
new Y( x, ) was used, i.e., a new success-failure comparison was made 
using a new random variable. In other words, given an a priori interval, 
the experimenter simply disregards certain data points if they violate 
this interval. Thus, the barriers possess a reflecting quality and the 
number of these reflections are counted. The results are listed in 
Appendix 2 and the method seems to be promising given | is within the 


barriers. 





It is obvious that the x, would tend to reflect on the lower 
barrier more than on the upper one even if a is contained within the 
interval. However, if fm is not in the interval, one expects the 
sequence to reflect upon one of the barriers an "abnormal" number of 
times and/or the x, to be driven close to this barrier. It should 
be possible to develop a rule for statistically testing this contingency. 
However, in this case, there is a difficulty in that the cause of in- 
ordinate numbers of reflections may be the result of too large a value 
Src. 

The values of c used cover a fairly wide range from too low a value, 
when no reflections occurred, to high values, when one wastes samples 
reflecting back and forth too often. These reflections will obviously 
occur in the first few iterations. But this is exactly the reason the 
experimenter set the bounds. 

It is still important that c should be within some "good" range of 
values since the investigator wants the process to settle down quickly 
yet possess enough reflections so that, after all computation is completed, 
their number and the final value of x, can give him some indication of 
the accuracy of his guess and/or the validity of his a priori interval. 
We recommend a value of c be calculated only after the length of the 
interval has been set and should be of such a magnitude as to allow one 
or two reflections on the upper barrier in the first few iterations given 
a low probability failure occurs. 

It should be noted that if the investigator has indications that the 
interval was not correct he can simply widen the barriers and, if desired, 
shift his guess and recompute the x,. If the barriers were simply 


widened with no concomitant shifting of the guess the x, computed before 


10 





the reflection are still valid and the process can be started over at 
this point. 

There are many variations of this method. Another variation which 
might be empirically feasible, is some sort of "squeeze down" method in 
which just enough iterations are made to allow the process to settle down, 
i.e., where n is so large that reflections cannot possibly occur. Then 
the experimenter can stop and, after noting the number of reflections on 
either barrier and his current value of x,, decide to continue the process 
or shift the barriers closer about his current x,, using it as his new 
"best guess'', recomputing a new value for c and starting the process 
again, i.e., with n = 1 but with different random samples. This is 
admittedly dangerous since it may be argued that even if O were within 
the interval at the beginning of the process that, when the barriers are 
shifted around x, and are made closer by some incremental value, one is 
assuming that: (a) the x, is actually closer to © than was Xo and 
(b) the investigator is in possession of enough knowledge that he will 
not reduce the length of the interval to such an extent that © will be 
outside the interval. 

At any rate, the authors believe the general method is advantageous 
by lessening the harmful effect of large oscillations and/or low guesses 


and should be used if possible. 


i 





5. Comparison of Sample Variance with Asymptotic Variance 


In an attempt to compare the behavior of the process for 
moderately large sample sizes with the theoretical asymptotic variance ., 
the process was carried out to xX399 twenty five times. For each Xp 


25-sample means and variances were computed. According to the asymptotic 


theory, yn *n SS 3) a ACO. oc ) and one might 

2 «Ky -l 
expect the sequence of 25-sample variances of x, eventually to behave 
like 1 Ge ‘4 (although it has not been proven that this is 


n 2 xX ,-l 
necessarily the case). The attempted comparisons were very disappointing. 


A data analysis showed the sample variance to be decreasing as the 


logarithm of the n th trial. A least squares fit of s* vs In (n ) 
was computed, where 
m 
s* = 1/m a, (x.. - X. )2 
fr 
m 
‘= 1/m a Xj 
j=l 
aL = the ae trial 
m = number of replications of the experiment. 


The fit was very good to the 2nd decimal. The mean and variance are 
also highly insensitive to either Ss or c. The results were, 

92 = - Bln (i) + A where 

A a 3.9, B ~S 0.42 


It should be noted that the variance must be positive, 1n (i) becomes 


infinite with i and the intercept is in the neighborhood of i = 1200. 


iz 





These results and a comparison with the asymptotic variance appear 
in Appendix 3. Clearly, the process is not at all close to the asymptotic 
values and yet it has become relatively stable. 

6. Block Averaging 

The importance of a good initial guess and the inactivity of 
the sequence when n becomes large has been emphasized throughout this 
paper. The latter fact, in particular, suggests the following technique 
might be of value. 

Given X*, choose some "large'' value of c and some "suitable" block 
size, m. Experiment to find x, as usual and call this value Xa,1° 
Then use this value to initiate the sequence again; i.e., letting Xm ,1 
be the starting point, proceed to generate a second block of m values 
using a decreased value of c. Let % ,,2 denote the "new'' x, and 
average Xp) 1 and oe and obtain the initial "guess" for a third 
block of size m, namely Xm, 2° In general, c is reduced in each 


block and the initial value for the j + lth block is 


Xo,jtl ~ *n, j 
Iterate in this manner for k blocks so that km =n is the total available 
sample size. 

Two cases were simulated and the results appear in Appendix 4; one 
for a low and one for a high guess, c is halved before beginning each 
block, m = 20, k = 10. Another case using c = In 20 in the initial 
block while the c in later blocks utilized the logarithm of the value 
of c immediately preceding. Only positive values of the constant were 
allowed. 

This method seems to have some merit although three difficulties 
immediately observable are: (a) the value of c to be used initially, 


13 





(b) the way this value should be decreased, and (c) the determination of 
the size of each block. 

7. A Pseudo Confidence Interval Stopping Rule 

Assuming We start in the vicinity of © and the process does 
not "stray" too far, we may view each Y( xj) as a Bernoulli trial at 
level, approximately, @¢< . In this case we may place "confidence" 
limits ( P, P ) on & , i.e. CONF( P< o <= P )=1-C€, 
where P is the solution to the equation 
n 
aa alee ks n-k 
(5) g(P) = > Ga! os) 2-0 
k=xt+1 : 


and P is the solution to the equation 


sal 

nh 

(6) g(P) = > PX (1-pynk - O€ /2 = 0 
k 


== 
where n = the number of iterations 


the number of successes (responses) in n iterations 


x 
(1-@¢5) = "confidence level" to be achieved before the process is 


terminated. 


rt 


For the case n xa, 


and if x=0, 


P 


1 - (ee P=0. 
For all other cases the solution to equations (5) and (6) are obtained 
by use of the recursive formula 


(7) Pee ore PAE) AGG) 
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where the initial Lea are the previous solutions for P and P while the 
new solutions are obtained at that P._,, such that g( P Bea 0 
9 


(Giees, e( PQ) = 0.0001). By Stirling's approximation for Ne it can be 


shown that K(n,x) > e'(Pm) and, for equation (5), 


iq) =n [ (n-1)/2 Fi x(n-1-x) | 1/2 exp [1/12(n-1)] 


— 


for 1 = x & n-2, and K(n,x) =n for x = 0,n-1. 


For equation (6), 


KGeeoe— en [ (n-1)/2. 77 (x=1) (n-x) | ie exp [in2c-1)] 
bo 2 xen, andes yx) = nefenex — ine 
The nesting procedure developed in Appendix 5 may be used to evaluate 
the g(Pm). The calculations are simple if done on a digital computer 
where some language such as SCRAP is available. 

The sampling continues until the difference P-Pis less than or 
equal to some preassigned low number. The corresponding x, is taken as 
the final estimate of © . 

It ae reasonable to assume that, if we use the preceding 
"dymamic barrier'' technique, there may exist times when a failure causes 
P to go below eX or a success causes P to go above & . If all trials 
were taken in a sufficiently small vicinity of © then either of these 
contingencies can be interpreted as resulting from a relatively unlikely 
sample sequence. Thus we reverse the outcomes, i.e., in the former case 
we change the failure to a success or, in the latter, a success to a 
failure. In this way the inequality P& CV = Pis always maintained 


until the stopping point is reached. Of course, xy, P and P are re- 


calculated with the change. 
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There is a serious danger in such arbitrary changes of data as may 
be emphasized by considering an extreme case. Suppose X, is actually a 
long way above @ , say 10 standard deviations. The method will artific- 
ially change the proportion of successes so as to maintain P& & = P 
and the process is prevented from moving in its natural direction. Thus 
one must start close to @ and use a small value of c. "How close" and 
"how small ac" are critical questions. 

The results of simulation using the regular normal distribution as 
the sample set appear in 2 groups. The first is a collection of pilot 
runs in which the simulations were terminated for various programming 
reasons not related to the stopping rule. However, the information gained 
was felt to be of value and is, therefore, included in Appendix 6A. 

The second set is composed of cases where longer running times were 
allowed in an effort to analyze the movement of the process and the effect 
of the stopping rule. In all of these cases X%) = 2,™% = 0.95, © = 1.645 
and the analyses of the results are given in Appendix 6B. The system 
never reached its programmed stopping point, eo, 1D SP 7002 -) bor 
the several values of c listed it appears that the reversal of outcomes 
was undesirable in the fashion mentioned previously. That is, our initial 
value for x, (0.35 standard deviations above @ ) was too far away and, 
in each case a high number of successes, especially near the beginning of 
the process, caused P to attempt to go above“ . In these cases, 
arbitrarily changing the outcomes and recomputing the Xn drove the process 
farther away from © . However, it can easily be seen that, being above © , 
if an inordinate number of low probability failures had occurred, an 
attempt by P to go below © would have aided convergence of the xX, when 
the failure was changed to a success. 


It will be noted that c = ln n was utilized in a few cases. This 
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function for c seems to be of help in certain cases and a proof of the 


po 00 
convergence of > att = pean n/n)* is given in Appendix 7. 
| | 


Wy 


lbhe Conclusions 

1. The authors believe that the asymptotic methods suggested in 
the various references while theoretically attractive have little 
practical value since, as seen from the overall results, the xn do not 
change significantly enough when n grows large to be of statistical value. 
In other words the process will normally settle on one side or the other 
by at least 100< n< 200 while further movement is too insignificant 
to warrant more samples. 

2. The method of a priori barriers on the guess seems to hold 
significant practical promise. The barriers do keep the x, "close" to 
the true © point and reduce sample size for the same accuracy of a 
given the interval contains the © point. 

3. It is felt that the accuracy of the initial guess, x5, is the 
most critical parameter. This is true in all cases, no matter which 
method is considered, but especially, of course, if barriers are desired. 

4. The value of a guess which is "slightly" high cannot be over 
emphasized, especially when &%& is large. The higher the value of &< the 
more concerned the experimenter should be to start the process above ©) 
since it is that much more probable that any guess will be driven downward 
initially when the activity of the process is at its highest level. 

5. The attempt to obtain a stopping rule by use of the pseudo 
confidence interval methods, while failing in a practical sense, never- 
theless gives an experimenter one bit of information within 100< n < 200 
samples. We think it is very important, for reasons previously outlined, 
to know which side of @ the xp are located. With the pseudo confidence 
interval method this bit of information is obtained, within a practical 


sample size, whenever P tries to go below or P to go above % . In the 
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first case the investigator will realize the x, is probably below © =; 
in the second, it is probably above O . We believe this method, if used 
with another method which improves convergence, can be used as a rule to 


"closeness" 


"stop when the first reflection occurs" even though no idea of 
is obtained. We also believe that these "violations" of “© can be 
obtained earlier in the process if a larger value of “X< 9 is used. 

In other words, the method of a priori static barriers discussed in 
subsection 4, Section C, seems to help significantly given sufficient 
reason for their placement. The pseudo confidence interval method could 
give this reason for if, after n has reached a sufficiently high value, 
P tried to go above ©< the experimentation could cease and an upper 
bound could be placed either at the current Xp point or slightly above 
it and the process could be re-started, i.e., X, = Xo, drawing further 
samples. Thus the investigator has some "confidence" for placing this 
bound on the } Xn { . In a similar fashion a lower bound could be 
obtained when P tried to go below Cc. 

It is obvious that a method such as this would require quite large 
sample sizes but they may still be an order of magnitude or so less than 
that which seems to be required of the unaltered Robbins-Monro technique. 

In any case, lacking further theoretical methods for accelerating 
the convergence of the Robbins-Monro process, methods similar to these 
seem to be worthy of further study. 

6. The block averaging technique holds some promise but the problems 
involving the choice of an initial c, the method of decreasing it and a 
"suitable" block size are apparent and need to be studied. Also, other 


versions of the idea deserve consideration. So far the method seems to 


he 





to have the effect of stabilizing the x, in the vicinity of @ 
We feel this is the best method where large sample sizes are not 
available and the experimenter is lacking prior data. 

7. Except, possibly, for a technique such as block averaging, the 
value of c does not seem to be as critical as it might appear to be at 
first glance. The experimenter should, of course, know the approximate 
magnitudes of the variables near level o%~ and can adjust c accordingly. 
Depending upon these magnitudes, "moderate" values for c should be 
chosen, assuming a good guess, to keep x, from straying too far away 
hicom = ‘on the £irst few Lterarions. 

As noted previously, for the N(0,1), 1 < c < 5 seems to be 
the best range for c when the guesses are within 0.5 of the true © point. 
We feel justified in assuming this would also be the case for a large 
class of distributions possessing tails within a fairly wide region 
about that of the N(O,1). For more general distributions, it seems 
appropriate to measure c in more general units, say standard deviations 


q or reciprocal slope VL, 
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E= Recommendations for Further Study 

1. The value of information obtainable by using a method similar 
to the pseudo confidence interval has been mentioned in conclusion 5. It 
is believed they are worthy of further study not only computationally, 
in an effort to obtain knowledge of the approximate sample sizes required 
to set bounds on the {xp} for different distributions, but analytically, in 
an effort to establish the degree of confidence one has in placing the 
bounds when using this method. 

In other words, some hypothesis testing scheme might be studied 
which would test the hypothesis that, when a bound is set on the i xp} A 
it is on the correct side of ©. 

2. Some method for damping the effect of a series of undesirable 
events, given the guess, x5, was close to ic. in order to prevent {xn} 
from straying too far away from © might be studied. 

A particular method in this class is the Casaro sum method of 
averaging. That is, it is known that if a sequence {Xn | + then 
the sequence of averages fxnf>]e where xn = 1/n a x; . This 

Az! 
technique is used to stabilize oscillating sequences and may result in 
accelerating the convergence for "close" guesses. 

3. Kesten (6) develops a method for accelerating convergence of 
the process based on the number of times the difference of the x, changes 
in sign. That is, he reasons that the closer to © the xp gets the more 
frequently the sign of their difference should change. This method 
should be studied further from a computational point of view and it may 
prove helpful given the x, actually are "close enough" to @ . 

4. For the interested reader the various methods discussed in this 


paper and those recommended above should, of course, be studied using 


21 





widely differing distributions to determine the class of distributions 
in which these methods could be practically used in accelerating 


convergence. 
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APPENDIX 


Investigating the range of c for 


1 


two distributions by a comparison 


of the x, with the true © point of the distribution, a maximum of 500 


iterations was allowed where L is the 
Ix, --@} = L for a cessation of the 
particular c means convergence of the 
not obtained in 500 iterations. More 
computation was carried out using the 


formation but with a different set of 


U(O,1) 


( = 


1. L=0.001, & = 0.95: 
Xo c n 


0.9 37 
24 

31 
67,19 
182 
27,68 
16,39 
16 

14 
101 
238 
12 


Digee: 


O26 


ODOULOMOdOasI A Ow 


© 
~ 
Soo oS Se | Oo oO oO HE a Se 


38 


Note: 

ey= 0.1(0.1)1.0 

only those cases which 
"converged" under 

n = 500 are 

listed 


24 


67 SF 


given comparison value, i.e., 
trials. Unlisted values for any 
Xn to within L of the @ point was 
than one value for n indicates the 
same distribution and initial in- 


random numbers from the distribution. 


@ ) 
Z L = ©.001, x, = 02. 
Cos Cc n 
0.96 0.5 49 
0.7 26 
0.8 16 
Leo 90 
0.97 0.9 DZ 
0.98 0.4 fi 
OF7 352 
0.6 is 
1.0 19 
0.99 none ~ 
a 0 a xn 0.8 
L c n 
0.002 O05 ZO 
Or. 16 
0.8 al 
10 78 
0.003 QO. 28 
0.8 246 
0.9 ats 
0.004 ORG 7 
Of? is 
ieee 36 
O2005 0.6 2 
O AY, Sy! 
1.0 > 





N(O,1) 
o = 0.95( © = 1.645) 


C Xo n 
1.6 1.0 Bx) 
lee? lie 16 
| Dee 2.0 39 
ere! i as. 7 
iS Pad) 3 
te 2a) 80 
1.95 2.0 23 
7 GN 1.0 415 
2255 2.0 10 
3.05 2.0 172 
2 55 1.5 147 
B55 eS 321 
4.05 in 144 
4.05 eas: 357 
4.05 20) 233 
4.55 42 78 
4.55 1.5 45 

4.55 DS 25 
E55) Do 115 
6.05 25 43 
6.55 ee 116 
6.55 26.5 139 


Mete: c = 072 = 6.55, x5 = 1.0(07b)2:5 
only those cases which "converged" under 


n = 500 are listed. 
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APPENDIX 2 


o& = 0.95, Bo ee ais 
200 iterations allowed 
L = length of allowed interval = 0.4 
A = lower barrier, B = upper barrier 
Lower Upper 
Xo A B @ X200 Reflections Reflections 
2 Lit eS, 5) ima03. 20 il 
ie: 2 We 0 5 lea7 94 18 6 
3 dlgell les, ) 1.4076 19 1 (at x9) 
3 Lisi ses, 10 E36 20 5 
ee Lee. 6 5 T5733 16 5 
1 72 0 10 iS 20 6 
1s, es | Oa | 5 G7 25 9 3 
lees Ea ed 10 1.63 Ly 7 
6 IIa ars) 5) ro 1) 1 
6 Ih 2) 1 ae: LG > 20 i 
1D | Ps, Woes. 2 127459 Z 0 
7 ie 9 5 Ih Syston) Uy 0 
lef i Is 10 67 78 0 
are: 1.6 20 > Laea2y a5 2 
rao 6 2.0 10 Ip, ois! 78 4 
a) ly Ze. 5 T7061 14 0 
1S ee | Ze. 10 e780, 49 2 
Lower Upper 
Xo L A B e X50 Reflections Reflections 
4 0.8 ALG Lad 2 Pole S12 0 il 
Ia 2G O29 Mes) Z 1.6943 0 0 
eee 0.6 eo 20 Z be 7 085 0 0 
Js: 0.6 es, 28 Z 1 55,0, eZ 0 
3 0.8 eet Zee Z T7026 0 0 
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APPENDIX 3 


Sample mean and sample variance of m repetitions of the process 


™ m 
aa ey , 2 y Nie 
xj = iL at Ss = iL (x5 j — x; ) 
m m 
i=l rel 


2 


; falls off as the logarithm of i 


Note that s 
Distribution curve is 


Normal (0,1) 


A= .95 
© = 1.645 
X = pL ES 
c =~ Ome 
m = 25 
_ iy et 
e — 10 A Xn Si 1 2x, -'! 
25 2.167 1,410 4.44 
50 22137 1.081 Zee 
75 2.060 0.972 1.49 
100 22032 0.861 gee 
2s Ze0i2 0.814 -980 
150 20. 0.747 .749 
175 2.009 0.699 7OoD 
200 2.006 0.659 2550 
229 1.996 0.626 -492 
250 1.989 0.602 ~486 
275 1.991 0.569 -402 
300 1.978 0.533 .362 


ZY 





APPENDIX 4 
Each block is 20 iterations. The current block average provides the 
"suess'" for the next block. The constant c is halved at the beginning 


of each block. 








N(O,1) 

xX = 0.950 

C-—le Gas 
Xo n Cc Sen *ave 
Ate: 20 Di) Laz EeZis 
40 Zee I roy ay? 
60 Laze lig store: L265 
80 On625 1.750 1.676 
100 O53 25 Ue i Par 1.686 
ZO 0.1563 1709 1.689 
140 O07 ei 1.709 Wg oS 
160 0.0391 L707 1.694 
180 07015 1.706 1.695 
200 0.0098 ies. 705 1.696 
ee: 20 10 hes ZAG ee Onl: 
40 D0 23008 1.637 
60 Za I 631. 1.868 
80 eZ re Ae) 1.829 
100 O2625 65 1.794 
L210) 0.3825 LeolF 17.05 
140 G21305 lee? Ib, aes 
160 O207om a0 Pe oU 
180 0.0391 e625 Lee7ih9 
200 O01 Ss LOA 1.709 

ee- |l, 20, Im In 20, In In In 20, ete. 
Ss I e x Xave 
lees) 20 Pape) 3) fl 12036 1.636 
40 Loo 2 1.638 L637 
60 0.0928 e350 O33 
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APPENDIX 5 
Nesting the Binomial 


For computational purposes the binomial sum may be nested by 


n 
Cc) poq?st = 5" (lea eee og (1 + nee 
oa . D (Es 7omeP 
1=x 
oo oll ap Se p@leeste actell® qd), ce) 
n-(xtl) p n-x Pp 


=p (1 + még) + n=l q (+ ne2eg GaGa itl gq (ec ee) 
P “ae P 


p Ba Nn-j p n-(j-L) P 


Sere | = ketene... «yo. 
Let A; = j q = 
: n-(j-1) P 


n 
n tae 
> Oa Pept. + ...01 + Ago (L + Aaj) (1 + A;)..4) 
i=x 
Let By = 1 and Bret] = 1+ AB 5 Ke = RS eel eet 
where n-x+tl is the number of terms in the series expansion of the 


binomial sum. 
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p” 


+ 


xt+1 


Avvo 


By 


Biextl = 


nx 


reece) Ce 
n-(x+1) p 


ya Canta ss 


i=x 
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APPENDIX 6A 
Sample data using pseudo confidence intervals 
F(x) is N(0,1) , 1 = eee 0 


n 


Sample size 


x, = Terminal value after N iterations 
P = Upper "confidence" bound 
P = Lower "confidence" bound 


Reflections high: P< 


Reflections low: P 7 
X= .95 X, = 1.4 
s Reflections 
Cc n Xa Poses High Low 
20) 76 1.344 .030 0 0 
20) 143 1.446 050 0 7 
2.0 175 1.450 054 0 5 
&= .95 x, = 1.5 
7 Reflections 
Cc n xy P-P High Low 
Zee 84 12436 IES: 0 0 
0 84 1,460 229 0 0 
n(n) 88 Meas: Ue)s 0 0 
In(nt1) 94 je 2/2 .098 0 0 
2.0 176 1.500 i062 0 4 
In(n+1) 220 1.691 038 0 0 
Ces 5 Xml 
_ Reflections 
c n a Pi aAP High Low 
126 84 I508 029 0 0 
220 143 og? .050 0 6 
PA 176 1.600 062 0 4 
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— 
oO ow’ © OO © © O00 © 


mMNrqtN NF Oh LP fo 


ln(nt+l 


Q 
i 


[oo ae, 


116 
143 
176 


nel 


28 
102 
eZ iL 
P22 
2 
167 
200 
Sey 
425 
210 
oie 


ye) 


156 
248 


ead 

Xn ip 2 
esos, .096 
1.644 110 
1.446 050 
X, = 1.8 

x Da 
1.688 ollie 
Iye/07 050 
x, = bee 

Xp P-2P 
L677 094 
1.689 .050 
1.692 062 
ee ee) 

xy P =a 
1.607 .098 
1.808 .049 
ei oe .098 
1.790 Os 
2208 089 
TOOL .050 
be7eZ J052 
1.686 049 
Lea96 087 
1.740 .048 
Lisi Oe .044 
ey = 2.2 

%y P-P 
1.864 .015 
2 AU MU 


a2 


Reflections 


High Low 
0 0 
0 0 
0 0 
Reflections 
High Low 
0 0 
0 3 
Reflections 
High Low 
0 0 
0 2 
0 0 
Reflections 
High Low 
0 0 
0 i 
0 0 
0 1 
0 0 
0 0 
0 0 
0 i 
0 0 
0 0 
0 0 
Reflections 
High Low 
0 0 
0 l 





APPENDIX 6B 


of = .95, G= 1.645 








x, (= 250,, (Someta, L = eae 

Successes Failures nay Pp P 
C=1i: 2) 1.76855 16 -94966 

Py tried toeeo ten changed success to failure 
1* 1.78493 99913 592213 
24 1.76774 pede 2) SM) .94343 
iL e/a 99570 .92611 
28 1.76473 99679 -94435 
i evo | 99274 Re 6 725) 
8 I FA TET s9oa27 25060 
i LL FLAT eS 98877 .92630 
58 goo LL 799750 94998 

P tried to go high 

1* 1.76339 .98930 94306 
12 Le OO 7 98996 94622 
Z Is ales) 98346 93367 


little change in numbers 
total # times P tried to go high = 4 
X1105 = 175674 .98470 94753 
ee 03717 


Note: * = deliberately changed actual experimental result 








C=2: Successes Failures ne P ne 
48 1.55412 11a) 93950 
1 1.59290 99895 90685 
33 1.54181 299937 94343 
1 1.56471 29570 92611 
28 ISS 6)2) 7/8) .99679 ~94435 
I leo Z 75 99274 A) S722 
8 1.54588 99322 93680 
1 1.56159 98877 92630 

little change no reflections recorded 

Pein aine: X71 qH3 = 1.57895 .98650 ~93236 
PeP = .05414 
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i> Min: 





Successes Failures x 

4 1.68750 

if Ze257150 

7 Pao 2 

if Ze ooay S 

68 2.08410 

1 Z ANB S5 

69 2.02768 

P tried to go high so changed 

1* 2.04643 

27 ZeOz 198 
P tried to go high 

1* 27 2 


ro} 


EG 


p90779 
Bish) Sy, 
.97196 
bps 5/3) Os, 
-99011 
99464 


99107 
52510) 


98930 


success to failure 


£ 


-47287 
~34272 
soo 2) 
99001 


~92429 


-90866 
94983 


.94133 


.94998 


94306 


little change in numbers, 15 more bounces recorded where 
tried to go high and had to change success to failure 


total # reflections recorded = 17 


X1135 = IE pero. -98542 
P-P = 
4 1.47917 0 
1 2.42917 ~98979 
7 2.22420 99573 
1 7550998 .97196 
68 2.14016 99559 
1 2.19809 oS NOL LIE 
69 2.04614 99464 
P tried to go high, changed success to failure 
1* 2-07739 7 oO7 
little change in numbers 
total # times P tried to high = 15 
X1077 1.99561 98762 
P-P = 


34 


94897 


03645 


47287 
34272 
.66121 
.29001 
92429 


90866 


94983 


94133 


94954 


.03808 


|r 





Successes Failures — 
C=ln(n+l): 17 1373875 .O 
ik 1.89415 99715 
yl 1.72470 99695 
ih 1.79941 99285 
ZA Ley ZO 99497 
il 1.78452 98875 
4 Lead 98933 
1 1.82666 98207 
iy 1.80761 .98417 
1 eo2739 97768 
9 1.83483 .98053 
1 1.88047 .97226 
18 1.84028 97741 
1 1.87982 .97404 
little change, P tried to go high 1 time 
x1087 = 1.75673 98739 
Poe 
C=6: 4 1.37500 0 
1 2.91500 .98980 
7 2.26904 99573 
1 2270750 a7 207 
68 2 loo 19 29557 
1 223770 99014 
70 7a 05) o\5)8, .99463 
P tried to go high changed success to failure 
1* 2.09065 mI91LoS 
26 2.04371 .99248 
P tried to go high 
1* 2.07557 98934 
26 2.03500 -99053 
P tried to go high 
1* 2.06254 .98805 
little change in numbers 
total # times P tried to go high = 13 
X1000 = 1.99340 98436 
P-P 
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Beh | 





P 





- 83843 
76243 
90677 
87980 
91394 
89629 
90167 
»88542 
. 89486 
.88128 
»89242 
.87934 
.89860 
88882 


-94769 
03770 
47287 
~ 34390 
.66018 
29087) 
92406 
90889 
95000 


94200 
94986 


.94319 
94988 


94441 


-94939 


.03497 





Successes Failures 
C=1 57 
P triedeto so) high 
ie 
24 
1 
28 
if 
8 
1 
58 
P tried to go high 


I ecas 


Xn 





_— 


el eel el el el ee 


by 


. 76855 


78493 
.76774 
nyo l9 
5/0473 
SiO Zia 
SOUT 
211908 
o/ JO 


76239 


total #times P tried to go high 


*1000 


La 


36 


75490 


i 


c 


399g IS 
99057 


-7957h 


w95678 
ee adi Td 
99321 
98550 
99248 


.98934 


.98436 


i 


94966 


~92240 
94321 
92035 
.94417 
SoZ 70 
93664 
92648 
94986 


94319 
.94939 


03497 





APPENDIX 7 


Proof of the convergence of (ln n)¢ 


n 
Let aa = K log, gn 
n 
10m! 1omtl 
ee a 
m m n@ 
ig m=o n=10° +1 m=0 n=10 +1 


For 10™+1 Sin SO we have 


log’n S (log jomtl, 2 = (mes 


lomtl 10m m+1 
also Dt lies dx = (- ee = 9 
1 =” ce 1o™ 10" 
10™ 
ane 2 2 2 2 
ye an — K = 9 (m1 ) < CO 
One 


n=l m=O 


since it is dominated by a geometric series. 
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